Performance Analysis of Voice Activity Detection Algorithms for Robust Speech Recognition

نویسنده

  • C.Ganesh Babu
چکیده

The emerging applications of speech technology especially in the fields of wireless applications, digital hearing aids or speech recognition are often requiring a noise reduction technique in combination with a precise Voice Activity Detector (VAD). In this paper, we compare the performance of the VAD algorithms like Zero Crossing Detection(ZCD), Weak Fricative Detection (WFD), Pitch Based Detection (PBD), Energy Based Detection (EBD) and Subband Order Statistics Filter (OSF) in presence of different types of noise like airport, babble, train, car, street, exhibition, restaurant and leopard for Automatic Speech Recognition (ASR). When analysis was done under various noise conditions for speech recognition, it was found that Subband Order statistics Filter (OSF) method algorithm performs better than other VAD algorithms.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Algorithm for Voice Activity Detection Based on Wavelet Packets (RESEARCH NOTE)

Speech constitutes much of the communicated information; most other perceived audio signals do not carry nearly as much information. Indeed, much of the non-speech signals maybe classified as ‘noise’ in human communication. The process of separating conversational speech and noise is termed voice activity detection (VAD). This paper describes a new approach to VAD which is based on the Wavelet ...

متن کامل

Advanced front-end for robust speech recognition in extremely adverse environments

In this paper, a unified approach to speech enhancement, feature extraction and feature normalization for speech recognition in adverse recording conditions is presented. The proposed frontend system consists of several different, independent, processing modules. Each of the algorithms contained in these modules has been independently applied to the problem of speech recognition in noise, signi...

متن کامل

Supervised/Unsupervised Voice Activity Detectors for Text- dependent Speaker Recognition on the RSR2015 Corpus

Voice activity detection, i.e., discrimination of the speech/nonspeech segments in a speech signal, is an important enabling technology for a variety of speech-based applications including the speaker recognition. In this work we provide a performance evaluation of the following supervised and unsupervised VAD algorithms in the context of text-dependent speaker recognition on the RSR2015 (Robus...

متن کامل

Bispectra Analysis-Based VAD for Robust Speech Recognition

A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...

متن کامل

Statistical Tests for Voice Activity Detection

A robust and effective voice activity detection (VAD) algorithm is proposed for improving speech recognition performance in noisy environments. The approach is based on filtering the input channel to avoid high energy noisy components and then the determination of the speech/non-speech bispectra by means of third order autocumulants. This algorithm differs from many others in the way the decisi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011